[0001] TITLE OF THE INVENTION 

REAL-TIME CONTENTS EDITING METHOD, SYSTEM, AND PROGRAM 

BACKGROUND OF THE INVENTION 
5 1. Field of the Invention 

[0002] The present invention relates to a real- time 

contents editing method, system, and program which are 
adapted to edit a large number of images, including live 
images, and/or voices which are present in a disposed fashion 

10 on the Internet and to distribute the edited images and 
voices to a plurality of users . 

2. Description of the Related Art 
[0003] Broadband networks, which have prevailed in 

recent years, enable distribution of moving-picture contents 

15 of large volumes. Since 2001, a great increase in the number 
of ADSL (Asymmetric Digital Subscriber Line) subscribers has 
been paving the way for a broadband environment . At the 
present time (as of March 2002), in Japan, 7.3% of all 
households subscribe to ADSL or CATV, and can receive data at 

20 a rate of about 1 to 10 Mbps and send data at a rate of about 
several hundreds kbps . Further, FTTH (Fiber To The Home) 
promises to prevail in the future; this is expected to enable 
communications at up to 100 Mbps (maximum speed) , as well as 
exchange (transmission and reception) of bulk moving-picture 

25 data between users. 

[0004] Consequently, many services and tools for 

distributing images and voices via a broadband network have 
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been proposed. To this end, high-quality image distribution 
systems utilizing broadband; e.g.. Ruff Systems 
( Publicationl , http : / /www . end . tel . co . jp/product /syo64 . html ) 
and DVTS (Publication2 , http: //www. sfc. wide. ad. jp/DVTS) , have 
5 been proposed. Ruff Systems is a system that distributes, on 
TCP/IP, non-compressed Dl/HDTV pictures and DV streams, the 
streams being taken by video cameras. Ruff Systems requires 
a wide band, because non- compressed images and DV images are 
used, but can distribute images in high quality. Meanwhile, 

10 many existing systems and tools utilize the MPEG moving 

picture format or the H.323 standard, which copes with narrow 
bands (see the following publications 3 to 6). These systems 
and tools (applications) enable a personal computer to 
receive, via the Internet, images taken by a camera. However, 

15 these applications are designed to handle a single moving 

picture, and therefore, are not suitable for handling a large 
number of moving pictures. 
[0005] Publication 3: 

ohphone : http : //www. openh323 . org/ docs /ohphone_man . html 
20 Publication 4: 

FFmpeg : ht tp : / /f f mpeg . sourcef orge . net/ 
Publication 5: 

http : / /www. tele . ucl . ac . be/PEOPLE/DOUXCHAMPS/ieeel394/coriande 
r/ 

25 Publication 6: 

"Experiments for Real-time MPEG- 4 Visual Encoder" by Yasuyuki 
Miura and Michitetsu Katsumoto, Research Paper of Information 
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Processing Society of Japan, 2002-DPS-100 , pp. 25-30, 2002. 



SUMMARY OF THE INVENTION 

[0006] In order to realize live distribution in which 

5 pictures taken by a video camera are distributed in real time, 
a coding process must be carried out as a real-time process. 
The real-time process must respond to a continuous external 
input and send back an output within a determined period of 
time. In a real-time processing system which distributes f 

10 frames of a moving picture per second, an editing process and 
packetization of a bit stream for each frame must be 
completed within an average of 1/f second, so that the 
resulting moving pictures can be distributed in real time. 
[0007] The following approaches are effective for 

15 achieving the above -described object. 

• A coding process is accelerated in order to complete 
the coding process within a limited period of time. 

• In order to ensure proper operation on machines which 
differ in level of performance, a time required to execute a 
20 process is monitored and, on the basis of the result of the 
monitoring, a suitable coding algorithm is selected from a 
plurality of coding algorithms. 

• A coding process is carried out in a step-by-step 
manner. After lapse of a predetermined time, required 

25 minimum coding is completed, whereupon an auxiliary coding 
process of enhanced resolution and compression ratio is 
carried out. In the event the auxiliary coding process is 
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not completed within a limited period of time, the auxiliary 
coding process is interrupted, and the result of the coding 
process in an immediately preceding step is distributed. 
[0008] On the basis of the above -described concept, in a 

5 real-time contents editing system of the present invention, a 
time required for coding of VOPs (Video Object Planes, which 
correspond to frames) is measured, and determination as to 
whether inter-frame compression is to be performed is made on 
the basis of the measured time and a predetermined average 

10 frame rate. 

[0009] The present inventors propose a real-time 

contents editing system which provides, for distribution 
systems dealing with individual streams, an operation for 
receiving a large number of images and voices as input data, 

15 and for editing, coding, and distributing the input data in 
real time. The present system codes and decodes MPEG- 4 video 
objects (VOs) and editing language codes in a distributed 
environment and has the following advantageous features over 
the conventional distribution systems . 

20 [0010] (1) Contents editing work using live images from 

a plurality of remote places is possible. 

(2) An image sender or receiver at each place can 
freely log on or off the system while the system is in 
operation. 

25 [0011] In order to provide contents suitable to 

individual users needs, many video objects (VOs) are sent to 
the user side to construct the contents. Therefore, a large 
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number of video objects (VOs) are concentrated on the user 
side. Consequently, allocating numerous frequency bands to 
an individual video object (VO) is difficult. Further, when 
a sender uses an asymmetrical communication service such as 
5 ADSL, the upstream bandwidth is narrower than the downstream 
bandwidth. Therefore, allocating numerous frequency bands to 
video objects (VOs) is still difficult. If a scheme adapted 
to transmit bulk data such as Dl non-compressed data or DV 
compressed data (e.g.. Ruff Systems) is adopted, a wide 

10 frequency band becomes necessary. Therefore, a system in 

which a large number of video objects (VOs) are sent/received 
concurrently requires a huge amount of frequency bands . 
Consequently, video objects (VOs) must be coded into MPEG-4 
video objects (MPEG-4 VOs), which have a high compression 

15 ratio and are adapted to various frequency bands. However, 
MPEG-4 VO coding requires a large CPU power for inter -frame 
compression, and, therefore, complex coding is difficult to 
perform in the case where coding of live images is performed 
by use of a low-performance machine in a real-time 

20 environment subjected to a strict restriction on time. For 
example, when a personal user sends a live image from his or 
her home, or during a business trip via portable equipment, 
in some cases only a home-use personal computer or similar 
equipment can be used. In contrast, when a relatively high 

25 performance server machine is used, more complex coding is 
possible as compared with the case where a low- performance 
machine is used. Thus, the range of performable processing 



5 



depends on the performance of a server machine to be used. 
[0012] In view of the above -described problems, an 

object of the present invention is to realize real time 
coding of video objects (VO) into MPEG-4 video objects (VOs) 
5 through execution of a coding process corresponding to CPU 
power . 

[0013] The present invention provides a real-time 

contents editing system which receives a plurality of images 
and/or voices as input data, and integrally performs coding 
10 to MPEG-4 video objects (VOs) # editing, and distribution in 
real time. The present system enables provision of contents 
from a plurality of remote locations, while using live images 
captured by use of a digital video camera. 

[0014] The present system is constituted by three kinds 

15 of modules; i.e., an editing module, distribution modules, 
and receiving modules. Each distribution module and each 
receiving module can freely be connected to or disconnected 
from the system while the system is in operation. Further, 
the distribution module can automatically select a coding 
20 method that enables high efficiency compression, by changing 
the type and use frequency of VOP to be employed in 
accordance with the performance level of a machine to be used. 
[0015] The real-time contents editing method, system, 

and program according to the present invention are adapted to 
25 edit a large number of images, including live images, and/ or 
voices which are present in a dispersed fashion on the 
Internet, and to distribute the edited images and/or voices 



6 



to a plurality of users. The method, system, and program 
according to the present invention are characterized by 
provision of a plurality of video cameras each serving as an 
input device, a plurality of distribution modules each 
5 adapted to code an input image taken by a corresponding video 
camera, by use of a coding standard which enables coding 
while selecting one of a plurality of coding algorithms and 
to distribute the coded input image, a plurality of receiving 
modules each adapted to receive and display the distributed 
10 image, and at least one editing module. Each distribution 

module changes, in accordance with the performance level of a 
machine to be used, the kind and use frequency of a video 
object plane (VOP) to be used, to thereby select a coding 
algorithm which enables highly efficient compression. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0016] FIG. 1 is a conceptual diagram of a real-time 

contents editing system according to an embodiment of the 
present invention; 
20 FIG. 2 is a block diagram showing the configuration of 

a converter; 

FIG. 3 is a flowchart illustrating a coding process in 
a distribution module; and 

FIG. 4 is a view showing an algorithm for determining 
25 appropriate N B vop and Np V op* 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
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[0017] Construction of real-time contents editing system> 

A real-time contents editing system is adapted to edit 
a large number of images , including live images , and voices , 
which are present in a dispersed fashion on the Internet, and 
5 to distribute the edited images and voices to a plurality of 
users. The present system is intend to operate irrespective 
of the kind of a communication network and machine to be 
employed, to thereby flexibly cope with a variety of existing 
environments. Consequently, in the present system, the MPEG- 
10 4 standard, which enables high compression ratio and can cope 
with various frequency bands, is used in coding images. 
Further, a scene description language is used for editing a 
plurality of images and voices. 

[0018] FIG. 1 is a conceptual diagram of a real-time 

15 contents editing system according to an embodiment of the 

present embodiment. The present system includes a plurality 
of video cameras each serving as an input device; a plurality 
of distribution modules each adapted to code input images, 
which are taken by the associated video camera, and to 
20 distribute the coded images; a plurality of receiving modules 
each adapted to receive and display the images; and at least 
one editing module. Further, in the illustrated system, a 
single distribution server, a single editing server, and two 
clients are interconnected via the Internet. The editing 
25 module is disposed in the editing server. One distribution 
module is disposed in the distribution server, and another 
distribution module is disposed in the editing server. One 



receiving module is disposed in each of the clients. One of 
the clients includes a distribution module in addition to the 
receiving module. In FIG. 1, DV (Digital Video) streams, 
MPEG-4 video objects (VOs) , multicast requests, input/output 
5 requests, and scene description are respectively designated 
by different kinds of arrows . 

[0019] Pursuant to an instruction from the operator, the 

editing module requests the distribution server to multicast 
the images and voices. Further, the editing module generates 

10 and multicasts a scene description language to be sent to the 
clients. The distribution module codes input images and 
voices, which are taken in the DV (Digital Video) format from 
an input device, into MPEG-4 image-voice data and multicasts 
the image-voice data to a plurality of clients via the 

15 Internet by means of personalized media stream distribution. 
In each client having a receiving module, moving pictures are 
displayed on the basis of a plurality of kinds of MPEG-4 
image- voice data and a scene description. Each video camera 
and a corresponding apparatus including a distribution module 

20 are interconnected via an IEEE1394 port. IEEE1394 ports are 
widely used in many kinds of DV equipment, and support 
isochronous transmission in which data is sent/received at 
high speed and at constant intervals. Therefore, IEEE1394 
ports are suitable for sending/receiving image and voice data. 

25 [0020] Each of the distribution module, the receiving 

module, and the editing module may assume an arbitrary form. 
For example, each of the distribution module, the receiving 
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module, and the editing module may be disposed as a program 
on a general -purpose server machine or may be operated on a 
home -use personal computer. Further, each of the 
distribution module, the receiving module, and the editing 
5 module may assume the form of a dedicated PC card and serve 
as an add-on system for a personal computer. 
<DV-MPEG converter in distribution module> 
1) Configuration 

[0021] In a real-time-processable contents editing 

10 system, a sender can distribute an image, while using a 

digital video camera and also using a personal computer at 
hand as a distribution server. Alternatively, the sender can 
distribute an image from a meeting room of the sender s 
office or any other place, while using a high-performance 

15 server machine. In the former case, occasionally a lower- 
performance personal computer is used as compared with that 
used in the latter case and, therefore, a complex coding 
process is difficult to execute. In the latter case, a more 
complex coding process can be executed, but the type of 

20 usable coding depends on the specifications of a server 

machine to be used. Further, the type of usable coding is 
restricted by the frequency band of a communication line at 
the sender side. If an asymmetrical line such as ADSL is 
used on the transmission side, the upstream transmission 

25 speed is several hundreds kbps, which is considerably slow as 
compared with the downstream transmission speed. For these 
reasons, coding suitable for the environment at the sender 



side must be performed. 

[0022] On many occasions, at least a predetermined frame 

rate must be maintained in order to maintain quality of 
moving pictures. Consequently, the present system is assumed 
to be subjected to a constraint of maintaining a frame rate 
higher than R (frames/s) (R: a predetermined number). 
[0023] In the case where coding is executed with use of 

only I-VOP, motion compensation relevant to inter- frame 
compression is unnecessary and, therefore, images that have 
been coded immediately previously are unnecessary. As a 
consequence, inverse quantization and inverse DCT (discrete 
cosine transformation) for coded images can be omitted and, 
therefore, the coding time can be shortened considerably (see 
above-mentioned publication 6). However, this raises a 
problem that coding increases the quantity of codes, and a 
form of coding is fixed; i.e., coding of an arbitrary form is 
difficult to execute. Notably, I-VOP represents a VOP 
obtained through a process of dividing input images into 
blocks, performing DCT for each block, quantizing DCT 
coefficients, and subjecting the quantized DCT coefficients 
to variable -length coding (intra-coding) . 

[0024] With regard to the above-mentioned problem that 

the quantity of codes increases, the quantity of codes can be 
adjusted, because in the MPEG- 4 standard the quantizing scale 
can be set freely. However, the image quality is degraded 
while the quantizing scale is set to a larger value. 
Therefore, if a host computer which includes a converter has 



a reserve of CPU power, inter-frame compression is desirably 
carried out. For this purpose, there is configured a 
converter capable of coding VOPs, including P-VOP (predictive 
VOP) and B-VOP (bi-directional predictive VOP) , for which 
5 inter-frame-compression can be performed. 

[0025] FIG. 2 shows the configuration of a converter 

that converts DV data into MPEG coded data. The 
configuration and operation of the converter have been known. 
In the present invention, coding processes shown in FIG. 2 

10 are classified or divided into basic processes and auxiliary 
processes. The basic processes are minimum coding processes 
to be performed at all times. The auxiliary processes are 
coding processes to be performed only when the converter has 
sufficient time to perform the auxiliary processes. In FIG. 

15 2, thick-frame blocks indicate basic processes; i.e., 

"variable -length decoding," "inverse quantization," "inverse 
DTC," "reconstruction," "DCT," "quantization," and "variable 
length coding." These basic processes are minimum processes 
that are essential when every VOP is I -VOP. Meanwhile, the 

20 remaining processes; i.e., "AC/DC prediction," "inverse 
quantization," "inverse DCT," "motion compensation," and 
"motion information coding" are defined as auxiliary 
processes. Among these auxiliary processes, "AC/DC 
prediction" is used for saving the code quantity of I -VOPs or 

25 P-VOPs, and the remaining auxiliary processes are used for 
inter-frame compression of P-VOPs or B-VOPs. 
2) Coding process in distribution module 



[0026] In the illustrated system, in order to determine 

whether or not P-VOP and B-VOP should be used and to 
determine the use frequency of P-VOP and B-VOP, an average 
coding time for 1 VOP is predicted through measurement of 
5 coding execution times of various processes, and a 

determination is made in accordance with a determined 
algorithm. FIG, 3 is a flowchart illustrating a coding 
process in such a distribution module. 
[0027] Step SI: 

10 As described above, coding processes are divided into 

basic processes and auxiliary processes. 
[0028] Step S2: 

At the startup of the distribution module, an input 
image is coded in the order of IBPBIBPBIBPB . . . , and for each 

15 VOP type the period of time required to process a plurality 
of VOPs is measured. An average processing time required to 
perform each kind of process is thereby measured. I, B, and 
P represent I -VOP, B-VOP, and P-VOP, respectively. 
[0029] In this processing time measurement, processing 

20 time per VOP is measured for Variable -length decoding," 
"inverse quantization," "inverse DCT," and "block 
reconstruction" in a DV decoder; and for "DCT," 
"quantization," "variable -length coding," "AC /DC prediction," 
"inverse quantization," "inverse DCT," "motion compensation," 

25 and "motion information coding" in an MPEG-4 encoder. Thus, 
average processing times of the individual processes are 
obtained. Among these processes, "motion compensation" 



provides different processings between P-VOP and B-VOP, and 
therefore, its processing time greatly changes between P-VOP 
and B-VOP. Therefore, the average processing time of "motion 
compensation" is obtained for each of P-VOP and B-VOP. In 
5 this manner, the coding execution times of the individual 
processes are measured. 

[0030] The time necessary for coding each VOP will now 

be described in more detail. When coding of all VOPs is 
executed by sole use of I -VOP, coding can be executed in the 

10 shortest time. However, when coding is executed by use of P- 
VOP or B-VOP, reference images for performing compensation on 
the basis of preceding and succeeding VOPs must be prepared. 
This necessitates inverse quantization and inverse DCT. 
Therefore, the period of time necessary for coding I -VOP 

15 depends on whether or not P-VOP or B-VOP are used in the same 
VO. The individual parameters necessary for evaluation are 
defined as follows: 

T min : time required for minimum coding 

T ACDC (n): time required for AC/DC prediction of n-th VOP 

20 T IQ : time required for inverse quantization 
T IDCT : time required for inverse DCT 
Tc: time required for motion information coding 
T M (n): time required for motion compensation of n-th VOP 
[0031] Other than T M and T ACDC each of the above- 

25 mentioned values assumes a fixed value which is determined in 
accordance with the specifications of a machine to be used 
and the size of an object image. By contrast, T M is a 



variable value that can be set freely, because even when a 
motion compensation process is interrupted, coding can be 
performed by use of the result obtained immediately before 
interruption. Further, T ACDC is also a variable value, 
5 because determination as to whether or not AC/DC prediction 
should take place can be made for every frame. Specifically, 
T M for P-VOP assumes any value within the range of 0 ^ T M (n) 
2s T M -Prnax; T M for B-VOP assumes any value within the range of 0 
£ Tn(n) ^ TM-Bmax? and T ACDC assumes a value of 0 or T ACDCrnax . 

10 pmax represents a period of time required for performing 
motion compensation for all macro blocks in P-VOP; T M - Bre ax 
represents a period of time required for performing motion 
compensation for all macro blocks in B-VOP; and T AC Dcmax 
represents a period of time required for AC/DC prediction for 

15 1 VOP when the AC/DC prediction is executed. Each of the 

above-mentioned parameters can be obtained by measuring the 
following various processing times and calculating their 
average values . 

[0032] T min : total of processing times of "variable- length 
20 decoding," "inverse quantization/' "inverse DCT," and "block 
reconstruction" in the DV decoder; and "DCT," "quantization," 
and "variable -length coding" in the MPEG- 4 encoder 
T ACDCmax : processing time of "AC/DC prediction" 
T IQ : processing time of "inverse quantization" in the MPEG- 4 
25 encoder 

T IDCT : processing time of "inverse DCT" in the MPEG- 4 encoder 
Tc: processing time of "motion information coding" 
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T M-pmax ; processing time of "motion compensation" for P-VOP 
T M-Bmax : processing time of "motion compensation" for B-VOP 
Notably, the processing time of "motion compensation" is 
measured when "motion compensation" is executed completely. 
5 [0033] The coding time of each VOP is as follows: 

Coding of I -VOP: 

• If P-VOP and B-VOP are not used in other VOPs, 

T(n) = T min + T ACDC (n) (1) 

• If P-VOP or B-VOP is used in other VOPs, 

10 T(n) - T mln +T ACDC (n)+T IQ +T IDCT (2) 
Coding of P-VOP or B-VOP: 

T(n) = T mln +T ACDC (n)+T IQ +T IDCT +Tc+T M (n) (3) 
[0034] Here, the number of occurrences of I -VOPs in N 

VOPs is represented by Nivop; the number of occurrences of P- 
15 VOPs in N VOPs is represented by N PV op; and the number of 
occurrences of B-VOPs in N VOPs is presented by N BV op. In 
this case, the total coding time of N VOPs is calculated as 
follows : 

Ttotal = 2^ T(n)+2 p . VOp T(n)+2 B . vop T(n) 
20 = Nivop(T m i„+TiQ+T ID cT)+2 I . vop T AC Dc(n) 

+ Npvop ( T mln + Tiq + Tidct + Tc ) + 2 p _ VOp ( T ACD c ( n ) + T M ( n ) ) 
+ Nbvop ( T mln + Tiq + Tidct + Tc ) + 2 B _ vop ( T ACD c ( n ) + T M ( n ) ) 
= Nx ( T mln + Tiq + Tidct ) + ( N PVO p + N B vop ) Tc 

+2 all -voP T ACDc(n) +2 p . vop T„(n) +2 B . vop T„(n) (4) 
25 The average coding time for 1 frame at that time is: 

Tave = Ttotal/N = Tmm+TiQ+T^cT+TcCNpvop +N BV op)/N 

+ < 2 a ii-voP T ^Dc(n)+2 p . VOp T„(n)+2 B . vop T M (n) )/N (5) 
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[0035] Since the values of T M and T ACDC are variable, 

each term that includes the operator 2 in formula (5) can be 
freely determined. The value of T M _ Pmax is about several times 
the value of T ACDCmax , and the value of T M _ Bmax is about several 
5 times the value of T M _ Pmax . Therefore, the coding time is 
preferentially allocated to AC /DC prediction, motion 
compensation of P-VOP, and motion compensation of B-VOP, in 
this order. Practically, since the time ratio of AC/DC 
prediction to the entire coding is small, in the following 

10 description, the AC/DC prediction is presumed to be executed. 
[0036] Step S3: 

The kind and use frequency of VOP to be used are 
determined in accordance with a VOP determination algorithm, 
from the average processing time measured in step 2, 

15 [0037] A target average coding time for 1 frame is 1/R, 

where R is a target frame rate. According to formula (5), 
the average coding time when motion compensation is 
completely executed using N pvop P-VOPs in N VOPs is 

Tave = T rain +T IQ +T IDC T+(Tc+T M -Pmax)Npvop/N+T A cDCmax (6) 
20 Therefore, the minimum condition for using N pvop P-VOPs in N 

VOPs is 

1/R as T min +T IQ +T IDCT +(Tc+T M 

-Pmax ) Npvop / N+T ACDCmax ( 7 ) 

Further, when motion compensation is completely performed for 
all micro blocks in P-VOP and B-VOP by use of N pvop P-VOPs and 
25 n bvop B-VOPs in N VOPs, an average coding time is 
Tave = T min +Ti Q +TiDCT+(Tc +T M -Pmax 

)N PV0P /N 

+ ( TC+Tm.^ ) N BVOp /N+T AC DCmax ( 8 ) 



Therefore, a minimum condition for using N pvop P-VOPs and N BVOP 

B-VOPs in N VOPs is 

1/R * T min +T IQ +T IDCT +(Tc+T M 

- Pmax ) ^P VOP / ^ 

+ ( Tc+T M _ Bmax ) N BVOP /N+T ACDCinax ( 9 ) 

5 [0038] Practically, the ratio between VOPs cannot be 

changed freely, and the ratio is determined by changing the 
number of P-VOPs sandwiched between two I -VOPs or the number 
of B-VOPs sandwiched between any combination of I -VOPs and P- 
VOPs (hereinafter referred to as "between two I -VOPs or P- 

10 VOPs"). If the sum of the number of P-VOPs sandwiched 

between two successive I -VOPs and 1 is defined as Np and the 
sum of the number of B-VOPs sandwiched between two successive 
I -VOPs or P-VOPs and 1 is defined as Nb, the number of VOPs 
of each kind included in N successive VOPs is expressed as 

15 follows: 

n ivop = N/NpNb (10) 
Npvop - (Np-l)N/NpNb (11) 
n bvop = (Nb-1 )N/Nb (12) 
Therefore, formulae (7) and (9) are expressed as follows: 
20 1/R ^ T mln +T IQ +T IDCT +(Tc+T M . Pmax )(Np-l)/NpNb 

+ ^ l ACDCroax (13) 

1/R * T mtn +T IQ +T IDCT +(Tc+T M . Praax )(Np-l)/N P Nb 

+ (Tc+T M . Bmax ) (Nb-l)/Nb+T ACDCmax (14) 
[0039] An algorithm for determining appropriate values 

25 of Nbvop and N PVO p is shown in FIG. 4. Inputs T and R of the 

algorithm represent a processing time and a target frame rate, 
respectively, of each coding process. T includes all coding 
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processing times, and is expressed by T={T min , T A cdc# Ti Q , Ti DC t/ 

TC , TM-Pmax » -BraaxJ • 

[0040] First, the determination of formula (13) is 

performed under the conditions that N BVOP =l and N pvop =2 . If 
5 motion compensation can be performed for all VOPs, the 

determination is repeated, while the value of N pvop is doubled 
every time. This determination is continued until the value 
of N pvop reaches a preset maximum value N PV opmax. During the 
determination, if conditions that fail to satisfy formula 
10 (13) are obtained, the value of N pvop is restored to an 

immediately preceding value, and coding is performed by use 
of I -VOPs and P-VOPs. 

[0041] If motion compensation can be performed for all 

VOPs even when the value of N pvop reaches N PV opmaxr N BVOP is set 

15 to 2, and the determination of formula (14) is repeated, 
while the value of N BVOP is doubled every time. This 
determination is continued until the value of N BVOP reaches 
Nfivopmax- During the determination, if conditions that fail to 
satisfy formula (14) are obtained, the value of N BVOP is 

20 restored to an immediately preceding value, and coding is 
performed by use of all kinds of VOPs. At that time, the 
number of I -VOPs decreases as N BVOP is increased. Therefore, 
in order to maintain a constant I-VOP ratio, N pvop is halved 
each time N BVOP is doubled. 

25 [0042] Step S4 : 

Coding is continued in accordance with the kind and use 
frequency of VOP determined in step S3. 
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According to the present invention, a system capable of 
editing and distributing a plurality of images and voices is 
constituted by three kinds of modules; i.e., an editing 
module, distribution modules, and receiving modules. In this 
system, a coding method which enables the highest possible 
compression efficiency can be automatically selected by 
changing the kind of VOP to be employed in accordance with 
the performance of a machine which uses the distribution 
module . 
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